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(57) Abstract: The invention refers to a method for identifying at least one binder molecule comprising the steps of: (a) choosing 
VO two amino acid types (AA1 and AA2) in a polypeptide or protein of interest, whereby AA2 at least once occurs directly subsequent 
to AA1 in the amino acid sequence of the polypeptide or protein, defining an amino acid pair AA1-AA2; (b) labelling the two 
amino acid types (AA1 and AA2) in the polypeptide or protein of interest, whereby all AA1 -residues is labelled with 13 C and all 
A A 2- re si dues with ,5 N; (c) generating a first HNCO-type NMR spectrum of the labelled polypeptide or protein from step (b), thereby 
identifying signals from the labelled amino acidpair AA1-AA2; (d) contacting the labelled polypeptide or protein with a potential 
binder molecule or a mixture of binder molecules under conditions and sufficient time for allowing binding of the potential binder 
molecule(s) and the labelled polypeptide or protein; (e) generating a second HNCO-type NMR spectrum, or a x H- l5 N correlation 
type NMR spectrum, of the mix from step (d), monitoring signals identified in step (c); (f) comparing the first and the second NMR 
J^. spectra, whereby a chemical shift change of the signals identified in step (c) between the two spectra indicates an interaction between 
the potential binder molecule and the labelled polypeptide or protein. 
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New NMR screening method 

Technical field 

5 The present invention relates to a nuclear magnetic resonance (NMR) based method 
for assaying binding of various chemical compounds to a polypeptide or a protein. 
More specifically, the method allows for the screening of binding to a designated 
binding epitope on the surface of the polypeptide or protein. 

0 Technical background 

In modern biology and medicine, there has been a demand for physical methods, 
which can make it possible to study the structure of small and large biomolecules, as 
well as the interaction between various molecules and compounds. For this purpose. 
5 several powerful techniques have been applied, such as x-ray crystallography, mass 
spectrometry, and nuclear magnetic resonance (NMR). 

Tn recent years, NMR spectroscopy has become an important tool in the drug dis- 
covery process through the advent of NMR based screening methods to identify lead 

!0 templates. Several techniques have been introduced. [Shuker, 1996; Chen. 1998; 

Mayer, J 999; Chen, 2000; Jahnke, 2000] perhaps the most well known is the "SAR 
by NMR" method described by Fesik and coworkers in 1996. [W097/1 847 1 , Shu- 
ker, 1996 and EP-B 1-0866967] The SA.R by NMR technique relies on detecting 
chemical shift changes in a two-dimensional 'H- 15 N correlation spectrum to identify 

>5 compounds that bind to the target protein. When a first ligand has been identified, a 
second ligand is sought for in the presence of saturating concentrations of the first 
ligand (screen for "second-site" binder). Provided the three dimensional structure of 
the protein is known and sequence specific NMR assignments of the protein back- 
bone resonances have been obtained, the two ligands may be linked based on struc- 

50 tural data. To first approximation the binding affinity of the linked compound will 
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be the product of the binding affinities of the individual compounds, resulting in 
high affinity "nanomolar binders" even if the starting compounds only had millimo- 
larto micromolar affinities. [Shuker, 1996] 

5 A prerequisite for the SAR by NMK method is that sequence specific resonance as- 

15 13 13 

signments have been obtained for the backbone NMR resonances ( N, C a . J C\ 
! H N ) of the target protein. This is a formidable task that demands several months of 
experimental work, and data analysis even for a relatively small protein. 

0 For structural studies of large proteins, Yabuki et al., 1998 5 discloses a method, 
which method involves the site-specific labelling of two in the peptide sequence 
following amino acids with 13C and 15N 5 respectively. This is done in a so called 
cell-free system, in order to avoid problems of disturbing incorporations. Hereby, 
due to 1 Jorcoupling it is possible to study only one signal in an adequate spectrum. 

5 This article is mainly focused on the development of methods for cell-free synthesis 
for structural studies of the human c-Ha-Ras protein, as well as protein-protein in- 
teractions. 

Accordingly, the known techniques for screening for small binder molecules to a 
0 specific site in a target protein, such as an active site, pose some disadvantages, in 
that they are time-consuming and involve a lot of experimental work as well as 
complex interpretation of data achieved. Thus, there is a need for a method allowing 
identification of binder molecules to a specific target in an easier and more effective 
way, thereby limiting the experimental work. 

5 

The object of the invention is to reduce the drawbacks and expand the possibilities 
of the prior art. 
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Summary of the invention 

This object is, according to the invention, solved by a method for identifying at least 

one binder molecule comprising the steps of: 
5 (a) choosing two amino acid types (AA1 and AA2) in a polypeptide or protein of 

interest, whereby AA2 at least once occurs directly subsequent to AA1 in the 
amino acid sequence of the polypeptide or protein, defining an amino acid 
pair AAI-AA2; 

(b) labelling the two amino acid types (AA1 and AA2) in the polypeptide or pro- 
10 tein of interest whereby all AA1 -residues is labelled with C and all AA2- 

residues with ,5 N; 

(c) generating a first HNCO-type NMR spectrum of the labelled polypeptide or 
protein from step (b) 3 thereby identifying signals from the labelled amino acid 
pair AA1-AA2; 

15 (d) contacting the labelled polypeptide or protein with a potential binder mole- 

cule or a mixture of binder molecules under conditions and sufficient time for 
allowing binding of the potential binder molecule(s) and the labelled poly- 
peptide or protein; 

(e) generating a second HNCO-type NMR spectrum, or a 'H-^N correlation type 
20 NMR spectrum, of the mix from step (d). monitoring signals identified in step 

(c); 

(f) comparing the first and the second NMR spectra, whereby a chemical shift 
change of the signals identified in step (c) between the two spectra indicates 
an interaction between the potential binder molecule and the labelled poly- 

25 peptide or protein. 

Hereby, by using efforts from the structural studies of large proteins in the field of 
drug molecule screening, the inventors have unexpectedly provided a method, which 
allows an easy identification of binder molecules to a target molecule. 

30 
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Yet another aspect of the invention is a method, whereby the labelled amino acid 
pair AA1-AA2 is unique within a sphere radius of 10 A, preferably 50 A, and most 
preferably within the whole polypeptide or protein. 

5 One embodiment of the invention is a method, whereby the labelled amino acid pair 
AA1-AA2 is within a binding pocket of the polypeptide or protein. This allows for 
the screening for binding to a single binding site. The requirement is that a unique 
amino acid pair occurs in the binding pocket. 

0 Another embodiment of the invention is a method, whereby the labelled amino acid 
pair AA1-AA2 is in the proximity, preferably closer than 15 A, of an active site 
within the polypeptide or protein. This enables the screening for specificity in vicin- 
ity of an active site in the polypeptide or protein. If the target protein has one or 
more unique amino acid pairs in the vicinity of an active site, whereby the unique 

5 amino acid pairs differs from other members of the same protein family, this can be 
used to screen for selectivity, i.e. the different proteins of the same family can be 
compared. Different compounds may give different chemical shift, why the longest 
distance from the active site, which makes specificity screening possible, varies. 

0 Yet another embodiment of the invention is a method, whereby the result of the 

method is compared to the result of any other suitable binding or activity assay, such 
as a fluorescence-based assay, a reporter gene assay, displacement assays or ELISA. 
This allows for a rapid confirmation of the binding to this specifically labelled site, 
or as a selection of candidates for more extensive study by any other suitable 

5 method. 

Accordingly, according to another aspect of the invention, the method is used for 
screening of a compound library. 
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Short description of the drawings 

Figure 1; Site specific labelling of PTP1B. All arginine and aspartate residues have 
been enriched with l3 C and 15 N, respectively. There is only one unique Arg-Asp pair in 
the sequence (boxed). 

Figure 2; 2D NMR spectra of 15 N-Asp, ,3 C-Arg labeled PTP1B. A) 2D TROSY of free 
protein, B) 2D HNCO of free protein, C-F) 2D TROSY spectra of C) free protein, D) 
protein + compound cocktail, E) protein + compound cocktail omitting PNU 179983 
and F) protein + PNU1 79983. 

Figure 3; Ribbon drawing of PTPlb showing the five selectively labelled amino acid 
residue pairs. The " Y-Ioop pairs" Arg47-Asp48 (I) and Arg43-Asn44 (II) are shown, 
as well as Argl56-Glnl57 (III), Arg 169-Glul70 (IV) and Argl99-Glu200 (V), and 
the active site cysteine (VI). 

Figure 4; All compounds were tested in a colorimetric assay. Samples containing I 
HM PTP1B, 2 mM para-nitrophenylphosphate (substrate) and 1 mM compound were 
prepared in 20 mM Tris-HCl buffer at pH 7.5 and incubated at room temperature for 5 
minutes. The reaction was stopped by addition of NaOH to raise the pH. The amount 
of inhibition was calculated from the measured absorbance of the resulting product 
(para-nitrophenole). 

Figure 5; Stereo-view of PNU 179983 binding to PTP1B as determined from a 
preliminary x-ray crystallographic electron density map obtained at 2.2 A resolution. 
PNU 179983 clearly interact not only with the active residues but also have close 
contacts with the Y-loop (highlighted) via hydrogen bonds. The closest distance 
between PNU 179983 ant the Y-loop is approximately 2.8 A. The crystals were grown 
in the presence of 2 mM inhibitor in 0.1 M pH 6.5 cacodylate buffer, 0.2 M Mg 
acetate, 16-20 % PEG 8000. The crystals belonged to space group P222 with cell 
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dimensions a=53.2 A, b=84.3 A, c=88.7 A, alpha=beta=gamma=90. Data extended to 
2.2 A with Rsym=6.7%. Data was processed and the model refined to R=25%. 

Figure 6 is a schematic view of one embodiment of the invention (screening for 
binding to a single binding site). Application I: screen for binding to single binding 
site. 

Figure 7 is a schematic view of another embodiment of the invention (screening for 
specificity in vicinity of an active site). Application II: screen for specificity in vicinity 
of active site 

Figure 8; NMR spectra recorded on a 200 \xM 13 C-Val 5 l5 N-Ala selectively labeled 
sample of M-FABP. The spectra were recorded at 20 °C on a 600 MHz Varian Unity 
INOVA NMR spectrometer using standard techniques. The sample was prepared in 20 
mM sodium phosphate buffer (10 % D 2 O/90 % H 2 0) pH 7.4 containing 50 mM NaCl, 
10 mM DTT and 0.02 % NaN 3 . NmrPipe (Delaglio et al., 1995) was used for data 
processing, (a) 2D 'H- 15 N fast-HSQC (Mori et al., 1 995), (b) 2D HNCO and (c) I D 
HNCO. The HNCO spectra in panels b and c identify the resonance corresponding to 
alanine 33 (Roman numeral II in panel a). Experimental times were 1 h 5 4h ? and 20 
min respectively. 

Figure 9; Plots of 'H-^N fHSQC spectra recorded at 600 MHz showing the region 
corresponding to the alanine 33 cross peak of Figure 8a. Experimental conditions were 
identical to those described in Figure 8 except the samples used for spectra b-d 
contained additionally 1 % d-DMSO. (a-d) Reference spectrum of free M-FABR. (b) 
1:1 mixture of M-FABP and compound cocktail, (c) 1:1 mixture M-FABP and 
compound cocktail omitting oleic acid, (d) 1:1 mixture of M-FABP and oleic acid. 
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Detailed description of the invention 

By a "compound library" is meant a set of chemical compounds, that for example 
can be used for assaying binding to a macromolecule. The compound library may 
5 comprise from about 2 - 100.000 compounds, preferably 100-1000. The compounds 
of the library can be of any size, preferably 50-1000 D. The compounds can be any 
organic molecules or natural products. 

By a "polypeptide or protein of interest'* is meant any amino acid sequence, or a 
10 complex of amino acid sequences, having a potential binding epitope. 

By a "potential binder molecule" is meant a chemical compound, which may bind to 
the polypeptide or protein of interest, and which may be a member of the compound 
library. The potential binder molecule may be a peptide, a polypeptide, a protein, an 
15 antibody, a nucleic acid molecule, a carbohydrate, or a part or a complex of one or 
more of the mentioned molecule types, or any other chemical compound of interest. 
Preferably, the potential binder molecule is a relatively small molecule, such as a 
molecule in the size interval of 50-1000 D. 

20 "AA1 V and "AA2" can be any one of the about 20 different naturally occurring 

amino acid types. Furthermore, AA1 and/or AA2 may also be any modified variant 
of an amino acid, as long as it is possible to detect the 1 J CN »-coupling. AA1 and AA2 
may be the same or different type. 

25 By AA2 at least once occurs "directly subsequent" to AA1 in the amino acid se- 
quence is meant that AA1 in this case has the sequence position n-1 and AA2 se- 
quence position n. 



SUBSTITUTE SHEET (RULE 26) 



WO 02/33406 



8 



PCT/SE01/02281 



A "AAl-AA2-pair" is a pair of amino acids following each other directly in se- 
quence, whereby AA1 has the number n-1 and AA2 the number n in the amino acid 
sequence. 

5 By a :; binding pocket" is meant an epitope on the polypeptide or protein of interest, 
that is expected to allow binding of a potential binder molecule. 

By a ki sphere radius 1 ' in this context is meant a defined distance in any direction in 
space from a defined pair of amino acids (AA1-AA2). 

10 . 

The core of the invention is a revitalization of a sequence specific labelling method 
that has not been extensively used. [Kainosho, 1982, Kato 3 1991, Yabuku 1991] All 
amino acids AA1 are labelled with I3 C and all amino acids AA2 are labelled with 
15 N. Provided only one AA1-AA2 pair.occurs in the amino acid sequence, only one 
15 signal in the ID carbonyl 13 C spectrum will display a splitting due to the ^cn cou- 
pling. Obviously only one peak will appear in a ID (or 2D or 3D) HNCO type cor- 
relation spectrum (Figure 1). 

The labelling strategy is only sequence specific in an indirect sense, i.e. the occur- 
20 rence of a unique pair of labelled amino acid residues confers the sequence speci- 
ficity. Using this technique it is possible to screen selectively for binding to a se- 
lected epitope without the need for sequence specific assignments. The HNCO 
spectrum can thus be used either directly as a screening experiment (ID or 2D ver- 
sions) or indirectly to identify what signals to monitor in a 2D *H- 15 N correlation 
25 spectrum. Chemical shift perturbations upon addition of a potential ligand are easily 
detected even for large proteins due to the reduced spectral complexity resulting 
from the use of a selectively labelled sample. 

The site specificity is inherent to the labelling technique and the key is to find a 
30 unique sequence motif in the binding site that is to be screened. Given a protein of 
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300 amino acid residues, and assuming equal and random distribution of all 20 
amino acid residues, the probability that a given amino acid residue pair only occurs 
once in the sequence is approximately 0.5. If the desired site contains at least 3-4 
suitable pairs, the probability of finding a unique pair is reasonably high, Le. 85-95 
5 %. 

The most reliable way to obtain a sequence specific labelled protein is to over- 
express the protein using cell-free synthesis as described by Yabuki et al. [Yabuki, 
1998]. Alternatively the protein could be over-expressed in rich medium containing 
.0 the labelled amino acids, using a bacterial strain with gene lesions suitable for the 
particular type of selective amino acid enrichment [Muchmore, 1989]. In favorable 
cases however, good results have been reported using a prototrophic bacterial strain 
in a rich medium containing all amino acids and nucleotides. [Kainosho, 1982] 
[Muchmore, 1989] 

15 

According to one embodiment of the invention, the method can be used for screen- 
ing for binding to a single binding site (figure 6). This can be done if the target pro- 
tein or polypeptide of interest has at least one unique amino acid pair (two amino 
acids being adjacent to each other in the primary structure) within the potential 
10 binding site. Then, this unique amino acid pair is labelled according to the method 
of the invention, and the interaction between the binding site and potential binder 
molecules can be studied. 

According to another embodiment of the invention, the method can be used for 
25 screening for specificity in vicinity of an active site (figure 7). This can be achieved 

if the target protein or polypeptide of interest has at least one unique amino acid pair 
* in vicinity of the active site (preferably closer than 1 5 A ; and more preferably within 

5-15 A). Further, if the target protein differs from other proteins within the same 

protein family in that the target protein has at least one unique amino acid compared 
30 to the other proteins, this can be used to screen for selectivity of potential binder 
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molecules to the target protein. This means that the unique amino acid, which differs ■ 
from the other proteins of the family, may be close to the active site with regard to 
tertiary structure, but not necessarily close with regard to primary structure. How- 
ever, the unique amino acid should preferably not be within the active site. 

5 

According to yet another embodiment of the invention, the method of the invention 
can be used in combination with any other suitable binding or activity assay, such as 
a fluorescence based assay, a reporter gene assay, displacement assays or ELISA, in 
order to either confirm the result of that method, or to screen for interesting com- 
10 pounds, which subsequently are more thoroughly studied by another method. The 
method of the invention can be advantageously used in this aspect as it provides a 
method, which rapidly gives a reliable result. This can be useful when there are no 
X-ray data for the studied protein, but only computer modelling data. 

1 5 The conditions suitable for allowing the potential binder molecule and the labelled 
protein or polypeptide of interest to interact, as well as to monitor the interaction by 
NMR, are standard conditions for protein NMR [Cavanagh] (buffered solutions, pH 
kept stable, reaction temperature 5-50 °C). Preferably, the target protein concentra- 
tion may range from 25 pM - 1 mM, and the potential binder molecule concentra- 

20 tion from 25 pM - 1 mM. 

In order for a change in the chemical shift to be considered relevant, the change 
should be equal to or greater than the natural line width, or the signals should be ex- 
change-broadened beyond detection. 

25 

Moreover, the method of the invention may be used for competition binding analysis 
experiments. 

To summarise, the new technique presents a new approach to screen and identify 
30 binders to protein targets in a site-specific manner. This will allow the identification 
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of scaffolds (by screening a small compound library) with a binding preference for a 
particular site or sites, which could confer specificity for a certain target within a 
protein family, and also confirm the binding mode from hits detected using other 
suitable binding or activity assays, such as a fluorescence based assay, a reporter 
5 gene assay, displacement assays or ELISA. 

Also, the new technique has the potential to allow NMR studies of proteins of much 
larger size than traditional SAR-BY-NMR since essentially no assignment of the 
protein resonances has to be undertaken. Conventionally, only proteins of the size 
10 up to approximately 30 kD are possible to study. With the method of the invention, 
proteins of the size 50 kD 3 and most probably of the size up to 1 00 kD, can be 
monitored. The sample preparation can be prepared according to previously de- 
scribed protocols. [Muchmore 1989, Yabuki 1998] 

15 For further details on NMR spectroscopy or on experimental approaches relevant in 
the field of the invention, W097/18471 and Cavanagh, 1996 are hereby incorpo- 
rated as references. 

Below, the invention is described by the following examples, which only are to be 
20 seen as examplifying the invention, and not limiting the scope of the invention in 
any way. 

Examples 

25 Example L: Screening for binding to the Y-loop of Protein Tyrosine Phosphatase- IB 
(PTP1B) (SEQ ID NO:l). 

PTP1B (35 kD, single domain protein] is a protein that dephosphorylates phosphoty- 
rosines. The active site binding cleft is centered around an active cysteine residue. It 
30 has been speculated that the so-called Y-loop of PTP1B is important for binding of 
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some ligands. The Y-loop contains a sequence motif that is unique in the sequence, 
i.e. Arg47-Asp48 (figure 1). A selective labelled sample of PTP1B was prepared to 
monitor binding to this site. 

5 Site specific labelled PTP1B (residues 1-298) was prepared from transformed Es- 
cherichia coli strain BL21(DE3) cells. Bacteria were grown in rich medium con- 
taining all 20 amino acids and 5 nucleotides according to the protocol described by 
Muchmore et al. [Muchmore, 1989] Aspartate and arginine were supplied l3 N- 
enriched and ^C-enriched, respectively. It should be noted that this protocol (i.e. the 

10 use of a prototrophic bacterial strain) for preparing the sample is not optimal for the 
production of selectively 1:> N-Asp enriched protein. Aminotransferase activity may 
cause misincorporation of I:> N in other amino acid residues. For the case of Asp, in- 
corporation in Asn, Glu and Gin residues may be seen, [Muchmore, 1989] Never- 
theless, the risk of misincorporation of the 55 N label is in no way a limitation to the 

15 invention. A selectively labelled protein may be produced with no misincorporation 
of the I5 N (or 13 C) label if a bacterial strain with gene lesions suitable to the par- 
ticular type of selective amino acid enrichment is used. [Muchmore, 1989] Alterna- 
tively, the sample could be prepared using cell-free synthesis [Yabuki, 1998]. It 
should also be noted that in favorable cases good results have been reported also 

20 when prototrophic bacterial strains were used. [Kainosho, 1982][Muchmore 5 1989] 

A two-dimensional l H- l3 G correlation spectrum of the selectively labelled sample 

1 3 113 

verified that only Arg residues were J C-enriched. The corresponding H- N ex- 
periment indicated, however, that 15 N was not exclusively incorporated in Asp resi- 

25 dues. Figure 2A shows the 2D ! H- l5 N TROSY [Weigelt, 1998] spectrum of the !5 N- 
Asp, 13 C-Arg sample. If only Asp residues were Lv N-labelled a total of 18 peaks 
would be expected in this ! H- l:, N correlation spectrum (18 Asp residues in the 
amino acid sequence). Instead the spectrum contains approximately 65 peaks, which 
is consistent with incorporation of ,V N also at Asn ? Glu and Gin sites (totally 69 in 

30 the amino acid sequence). Studying the amino acid sequence in detail one can see 
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that there now instead of one unique pair, Arg47-Arg48, are four additional ones, 
Arg43-Asn44 5 Argl56-Glnl57 ; Argl69-GluL70. and Argl99-Glu200. The first of 
these, Arg43-Asn44, is located in the beginning of the Y-loop. The other three are 
situated on the protein surface far from the binding site (figure 3). 

5 

Figure 2B shows the 2D HNCO [Cavanagh, 1996] spectrum that selects signals 
from the selectively labelled pairs. As expected, five peaks corresponding to the five 
unique sequence pairs appear in the spectrum (circled). An additional peak (marked 
with an arrow) originating from the C-terminal Asp residue is also seen. The natural 
10 abundance ,3 C (1 %) of the preceding Glu residue is enough to yield an HNCO sig- 
nal due to the extremely strong H^-resonance of the C-terminal Asp residue. 

To detect binding to the Y-loop one would expect that one or two of the designated 
signals would experience large chemical shift changes upon addition of a compound 
15 that interacts with the Y-loop 5 whereas the other three signals would remain essen- 
tially unaffected. 

To test this hypothesis the inventors recorded 2D TROSY spectra in the presence of 
a cocktail of five compounds. One of the compounds, N200, was known to bind to 

20 the protein as determined by a NMR-line broadening assay (more than 5 Hz line 
broadening at 50 \xM concentration in the presence of an equimolar amount of 
PTP1B). Three compounds, N35, N136 and N212. were non-binders as determined 
by NMR. The fifth compound, PNU 179983, was a micromolar inhibitor of PTP1B 
believed to interact with the Y-loop. Figure 2C shows a selected area from the 

25 TROSY spectrum of free l5 N-Asp, )3 C-Arg PTP1B. Figure 2D shows the spectrum 
in presence of the cocktail. Figure 2E shows the spectrum in presence of all com- 
pounds from the cocktail except PNU 179983. Figure 2F shows the spectrum with 
only PNU179983 present. In all spectra the signals corresponding to the HNCO se- 
lected signals are circled. 

30 
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It is evident that substantial spectral changes occur in the presence of the compound 
cocktail (figure 2D). In particular two of the designated peaks (marked with aster- 
isks) disappear altogether, whereas the other three signals only experience minor 
shifts. This indicates that at least one of the compounds in the cocktail interacts 
5 strongly with the Y-loop. When PNU 179983 was omitted from the cocktail (Figure 
2E) no shifts are observed for any of the designated signals. Figure 2F shows that 
PNU 179983 is responsible for the large shifts since this spectrum shows the same 
pattern as the one recorded in presence of the full cocktail (figure 2D). 

10 To further substantiate the finding that the invention allows identification of binders 
to a selected site all compounds were tested in a colorimetric assay (figure 4). N 
200, which binds to PTP1B according to the NMR line-broadening assay, showed 
partial inhibition of the enzyme, indicating binding to the active site. N35 and N136 
also showed partial inhibition, which would indicate that they were false negatives 

15 or interacting too weakly to indicate binding in the NMR line-broadening assay. 
PNU 179983 showed complete inhibition of the enzyme. 

Furthermore, an x-ray crystallographic study was performed to assess the binding 
mode of PNU179983. Preliminary x-ray crystallographic data at 2.2 A resolution 
20 (Derek Ogg, unpublished results) showed that PNU179983 binds to PTP1B with 
interactions not only within the active site but also with the Y-loop (figure 5). The 
inhibitor is in close proximity to the Y-loop and interacts with the loop via hydrogen 
bonds. The shortest distance between the inhibitor and the Y-loop is approximately 
2.8 A. 

25 

Example 2: Site selective screening of human muscle fatty acid binding protein (M- 
FABPXSEQ ID NO:2). 
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The principle of the site-selective screening method is demonstrated for the human 
muscle fatty acid binding protein M-FABP. A 1.4 A X-ray crystal structure of M- 
FABP in complex with a ligand, oleic acid, is available (PDB ID code: 1 HMS). 
[Young, 1994] The structure consists of 10 anti-parallel P-strands and two ot-helices 

5 that connect p-strands 1 and 2. The P-strands form two nearly orthogonal P-sheets. 
The fatty acid binding site is situated in a cavity between the two P-sheets. The cav- 
ity is also lined by residues from helices 1 and 2. The carboxylate group of the fatty 
acid forms hydrogen bonds (direct and solvent mediated) to the protein. The ali- 
phatic tail of the fatty acid adopts a u-shaped conformation in the highly hydropho- 

10 bic cavity. Alanine 33 makes contacts with carbons C12 5 C13 and C 14 of the oleic 
acid. [Young, 1994] Valine 32 and alanine 33 comprise a unique amino acid residue 
pair in the amino acid sequence, and are thus suitable for selective labeling. It is 
likely that the NMR resonances of alanine 33 will be affected upon binding of oleic 
acid. A 13C'-Val, 15N-Ala selectively labeled sample of M-FABP was produced 

1 5 from prototrophic Escherichia coli cells harboring a plasmid containing the gene 
coding for a 143 amino acid protein construct of M-FABP. The construct encom- 
passed a C-terminal his-tag included for purification purposes. Cells were grown in 
a fermentor using a growth medium containing all amino acids and nucleotides ac- 
cording to the protocol of Muchmore et al. Valine was supplied with the carboxyl 

20 carbon 13C-labeled, alanine was supplied 15N labeled (Cambridge Isotope Labora- 
tories). Muchmore et al. advocates the use of auxotrophic bacterial strains with gene 
deletions suitable for the particular type of amino acid labeling that is desired. Even 
better performance is expected from cell-free synthesis [Yabuki, 1998], Good re- 
sults have, however, been reported also when prototrophic bacterial strains were 

25 used. [Kainosho, 1982; Kato, 199 L; Muchmore, 1989; Senn, 1987] The protein was 
purified by affinity cromatography using a Ni2+ charged 5 ml HiTrap Chelating 
column (Amersham Pharmacia Biotech), followed by a Lipidex-1000 column 
(Sigma) as described by Constantine et al. 
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The protein mass was determined using Electrospray mass spectrometry. The extent 
of incorporation of 13C and 15N in valine and alanine residues was estimated to be 
at least 95 %. 

5 Figure 8 shows NMR spectra of selectively labeled M-FAJBP. The signals visible in 
a 1H-15N correlation experiment (Figure 8a) correspond perfectly to the alanine 
resonances of M-FABP. [Constantine, 1998] The 2D HNCO experiment yields 
cross peaks only for 13C-15N-IH moieties. There is only one such instance in the 
selectively labeled M-FABP protein, namely f I3C)Val32-(15N)Ala33. The HNCO 
10 spectrum in Figure 8b contains only one cross peak which readily identifies the 

resonance corresponding to alanine 33, in agreement with the published resonance 
assignment. The spectrum displayed in Figure 8c demonstrates the feasibility of the 
use of a ID HNCO experiment for detection. 

15 Five test compounds - oleic acid and four compounds from our NMR screening li- 
brary showing no NMR line broadening when mixed with M-FABP - were mixed 
into a compound cocktail. Figure 9. displays spectral expansions of spectra recorded 
on selectively labeled M-FABP in different mixtures. 

20 As expected the HSQC cross peak belonging to alanine 33 experiences a chemical 
shift change upon addition of the test cocktail (Figure 9b)> The same chemical shift 
change is observed when M-FABP is mixed with oleic acid alone (Figure 9d). 
Omitting oleic acid from the test cocktail leaves the spectrum unchanged (Figure 
9c). Clearly the site-selective labeling method promises to be a valuable tool for 

25 identifying compounds with specific binding properties. Potential applications in- 
clude: screening for binders to a selected site that has been identified either through 
x-ray. NMR or modeling studies: an assay to confirm binding of ligands identified 
by other methods (i. e. HTS) to a desired site; screening for ligands that bind to a 
site that could confer binding specificity for one target protein within a protein fam- 

30 ily. Due to the reduced spectral complexity resulting from the use of a selectively 
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labeled sample, the method should be applicable to larger proteins than are conven- 
tional methods. 
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Claims: 

1. Method for identifying at least one binder molecule comprising the steps of: 

(a) choosing two amino acid types (AAl and AA2) in a polypeptide or protein of 
5 interest, whereby AA2 at least once occurs directly subsequent to AAl in the 

amino acid sequence of the polypeptide or protein, defining an amino acid 
pair AA1-AA2; 

(b) labelling the two amino acid types (AAl and AA2) in the polypeptide or pro- 
tein of interest, whereby all AAl-residues is labelled with ,J C and all AA2- 

0 residues with l5 N; 

(c) generating a first HNCO-type NMR spectrum of the labelled polypeptide or 
protein from step (b). thereby identifying signals from the labelled amino acid 
pair AA1-AA2; 

(d) contacting the labelled polypeptide or protein with a potential binder mole- 

.5 cule or a mixture of binder molecules under conditions and sufficient time for 

allowing binding of the potential binder molecule(s) and the labelled poly- 
peptide or protein; 

(e) generating a second HNCO-type NMR spectrum, or a l H- l5 N correlation type 
NMR spectrum, of the mix from step (d), monitoring signals identified in step 

M (c); 

(£) comparing the first and the second NMR spectra, whereby a chemical shift 
change of the signals identified in step (c) between the two spectra indicates 
an interaction between the potential binder molecule and the labelled poly- 
peptide or protein. 

15 

2. Method according to claim 1. whereby the labelled amino acid pair AAl -AA2 is 
unique within a sphere radius of 10 A, preferably 50 A, and most preferably within 
the whole polypeptide or protein. 
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3. Method according to anyone of claims 1-2, whereby the labelled amino acid pair 
AA1-AA2 is within a binding pocket of the polypeptide or protein. 

4. Method according to anyone of claims 1-2, whereby the labelled amino acid pair 
AA1-AA2 is in the proximity of an active site within the polypeptide or protein. 

5. Method according to anyone of claims 1-2. whereby the result of the method is 
compared to the result of any other suitable binding or activity assay. 

6. Method according to any of the preceding claims, whereby the polypeptide or 
protein has a size of 10 - 150 kD. 

7. Method according to any of the preceding claims, whereby the potential binder 
molecule is a peptide, a polypeptide, a protein, an antibody, a nucleic acid molecule, 
a carbohydrate, or a complex of one or more of the mentioned molecule types. 

8. Method according to any of the preceding claims, whereby the potential binder 
molecule is a relatively small molecule, preferably in the size interval of 50-1000 D. 

9. Method according to anyone of the preceding claims, whereby the method is used 
for screening a compound library. 
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Figure 3 
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Figure 4. 
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